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1. Title of the Invention 

IMAGE DATA COMPRESSION METHOD 

2. Claims for the Patent 

(1) An image data compression method for dividing 
input image data of one screen into a plurality of 
blocks, one block of p x p pixels, and encoding and 
compressing each divided block in real time by an 
encoding method of combining an orthogonal transform 
and a variable length coding, including: 

measuring the generated code amount for each 
block line for which the encoding is in progress, and 
controlling a code amount control parameter for each 
block line so that the code amount of one entire screen 
may be less than or equal to a predetermined target 
code amount, based on this measurement result. 

(2) The image data compression method according 
to claim 1, characterized by including controlling the 
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code amount control parameter so that the target code 
amount at each point in time for each block line for 
which the encoding is in progress may be larger in the 
central part of said image, and smaller at the upper 
end part and the lower end part of said image. 

(3) The image data compression method according 
to claim 1 or 2, characterized in that the code amount 
control parameter is a scale factor. 

3. Detailed Description of the Invention 
[ Industrial Application Field] 

The present invention relates to an image data 
compression method for compressing and encoding the 
still image data for transmission or recording, 
including controlling the code amount after compression 
and encoding to be less than or equal to a required 
code amount . 
[ Conventional Art] 

As a highly efficient compression encoding 
technique for the natural image, a method of combining 
the orthogonal transform and the variable length coding 
has been examined. As an encoding method for the color 
natural image, the method of this kind will be adopted 
as the international standardization method (refer to 
Journal for The Institute of Image Electronics 
Engineers of Japan: Vol 18, No. 6, P398 to P407). 

In the encoding method of this kind, the image 
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quality of decoded image and the code amount in 
encoding can be controlled by controlling a certain 
kind of parameter. The relationship between the image 
quality of decoded image and the code amount is such 
that as the code amount is larger, i.e., as the degree 
of compression is smaller, there is less degradation in 
the image quality from the original image, or 
conversely, as the code amount is smaller, i.e., the 
degree of compression is larger, there is more 
degradation in the image quality from the original 
image. However, in the encoding method of this kind, 
an adaptive process is performed using a local 
correlation of the image in encoding, whereby the 
highly efficient compression coding is enabled 
according to the image quality of object image. 
[ Problems to be Solved by the Invention] 

By the way, generally, the image of minute 
pictorial pattern has a greater code amount, and the 
solid, flat image has a smaller code amount. Therefore, 
in the encoding method as described previously, if the 
encoding is performed using the same code amount 
control parameter, the obtained code amount is 
different for each object image. Accordingly, the code 
amount generated by encoding can not be predicted in 
advance, whereby it is not easy to encode any object 
image to obtain a certain code amount. Particularly, 
the application of the natural image (still image) 
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encoding may be made so far to a still image television 
telephone, a facsimile, storage in an image database, 
or the like, whereby it is not necessarily required to 
encode any object image to obtain a certain code amount. 

However, if the encoding method of this kind is 
applied to the electronic still camera, a function 
capable of recording a certain number of images on a 
predetermined recording medium is required, and a 
technique for compressing and encoding each object 
image with a certain code amount is required. 
Especially in the consumer-oriented camera, the 
photographer needs to know how many more images can be 
recorded on the recording medium, whereby this 
technique is important. However, since the information 
amount contained in the image generally changes for 
each image as previously described, if all the object 
images are encoded with a definite code amount control 
parameter, there is inconvenience that the number of 
images recordable on the recording medium may change 
depending on the object images to be taken. 

It is an object of this invention to provide an 
image data compression method for controlling so that 
the code amount after compression and encoding may be 
less than or equal to the code amount required for any 
object image. 

[ Means for Solving the Problems] 

This invention provides an image data compression 
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method for dividing input image data of one screen into 
a plurality of blocks, one block of p x p pixels, and 
encoding and compressing each divided block in real 
time by an encoding method of combining an orthogonal 
transform and a variable length coding, including 
measuring the generated code amount for each block line 
for which the encoding is in progress, and controlling 
a code amount control parameter for each block line so 
that the code amount of one entire screen may be less 
than or egual to a predetermined target code amount, 
based on this measurement result. 

Also, the image data compression method includes 
controlling the code amount control parameter so that 
the target code amount at each point in time for each 
block line for which the encoding is in progress may be 
larger in the central part of the image, and smaller at 
the upper end part and the lower end part of the image. 
[ Operation] 

According to this invention, to make the code 
amount of one entire screen less than or equal to a 
predetermined target code amount, the average code 
amount up to that point in time of encoding is measured 
for each block line during the encoding, and the code 
amount control parameter is controlled so that the code 
amount may be less than or equal to the target average 
code amount up to the point in time of encoding. 

That is, if the average code amount exceeds the 
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target average code amount, the code amount control 
parameter is controlled so that the code amount after 
compression is decreased for the next block line, or 
conversely if the average code amount is less than or 
equal to the target average code amount, the code 
amount control parameter is controlled so that the code 
amount after compression may be increased for the next 
block line. However, if the average code amount is 
less than or equal to the target average code amount 
and falls within a permissible range, the current code 
amount control parameter is maintained for the next 
block line. 

In this case, the total target code amount may be 
changed halfway so that the target average code amount 
may be larger in the central part of the image, and 
smaller at the upper end part and the lower end part of 
the image, whereby more code amount is allocated to the 
central part to reduce a degraded image quality in the 
central part of decoded image. 
[ Embodiments] 

Figure 1 is a schematic diagram showing a 
procedure for an image data compression method 
according to one embodiment of the present invention. 

This embodiment includes dividing input image 
data of one screen into a plurality of blocks, one 
block of p x p pixels, for example, 8x8 pixels (step 
Tl), and performing a two-dimensional discrete cosine 
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transform (DCT) for each block (step T2) . The DCT is a 
kind of orthogonal transform. Supposing that the 
transformation coefficient is F uv (u, v = 0, 1, p- 
1), and input image data for one block is f ±j (i, j = 0, 
1, p-1) , F uv is defined by 

[ Equation 1] 

Where C(w) = 1/V2 (w = 0) 

= 1 (w * 0) 

The obtained transformation coefficient F uv 
indicates the component in which input image data of 
one block is decomposed into spatial frequencies. 
Among the transformation coefficients F uv , the 
coefficient F 0 o indicates the value (DC component) 
proportional to the average value of input image data 
fij over p x p pixels, in which as the variables u and 
v increase, the component (AC component) at higher 
spatial frequency is indicated. 

The p x p two-dimensional DCT coefficients F uv 
thus obtained are divided by respective thresholds of a 
quantization matrix composed of p x p thresholds, and 
quantized (step T3) . Figures 2 and 3 show the examples 
of the quantization matrix for brightness signal and 
color difference signal. Each threshold of the 
quantization matrix is multiplied by the factor 2 s (S = 
0, ±1, ±2, ...) as will be described later, whereby the 
code amount and decoded image quality are controlled by 
changing the quantization step width. 
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The differences between the DC components of the 
quantized DCT coefficients and the DC components 
quantized in the previous block are taken (step T4) . 
The number of bits for a difference is Huffman coded 
(step T5) . The AC component is converted into one- 
dimensional number sequence by performing the zigzag 
scan within the block according to a zigzag scan table 
as shown in Figure 4 (step T6) . The number of 
consecutive zero data is compressed by the run length 
coding (step T7). The two-dimensional Huffman coding 
is performed by the number of consecutive zero data and 
the number of bits in the effective coefficient 
obtained by the run-length coding (step T5) . The 
Huffman coded data is stored as variable length coded 
data in the recording medium such as an IC card. 

In this case, the code amount finally obtained as 
the variable length coded data can be controlled by 
changing each threshold of the quantization matrix in 
quantizing at step T3. In this example, each threshold 
of the quantization matrix is multiplied by the factor 
2 s and changed to control the code amount after 
compression and the quality of decoded image. Herein, 
the width S of the factor 2 s is called a scale factor, 
and the relationship with the code amount is shown in a 
table at the following page. 
[ Table 1] 

That is, if the scale factor S is increased, the 
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quantization step width is larger, and the code amount 
is decreased. Conversely, if the scale factor S is 
reduced, the quantization step width is smaller, and 
the code amount is increased. This qualitative 
relationship holds without regard to the object image, 
but the quantitative relationship between scale factor 
S and data amount is not uniquely determined, whereby 
the code amount obtained for each object image is 
different even if the data is encoded with the same 
scale factor S. 

Thus, in this embodiment, the average code amount 
up to the point in time of encoding is measured during 
the encoding, and the scale factor S is controlled so 
that the code amount is less than or equal to the 
target average code amount up to that point in time of 
encoding to perform a scale factor control (step T8) . 
That is, when the average code amount exceeds the 
target average code amount, the quantization step width 
is made larger by increasing the scale factor S, so 
that the code amount after compression is reduced. 
Conversely, if the average code amount is less than or 
equal to the target average code amount and small 
beyond the permissible range, the quantization step 
width is made smaller by decreasing the scale factor S, 
so that the code amount after compression is increased. 
If the average code amount is less than or equal to the 
target average code amount and falls within the 

- 9 - 



JPA04-018857 



permissible range, the current value of the scale 
factor S is maintained. 

The scale factor control (step T8) is performed 
by dividing one screen composed of plural m x n pixels 
as shown in Figure 5A into plural M x N blocks, one 
block of p x p pixels, as shown in Figure 5B, measuring 
the generated code amount for each obtained block 
column (hereinafter referred to as a block line) in the 
horizontal direction, and adjusting the scale factor S 
for the next block line based on the result. 

The details of the scale factor process (step T8) 
will be described below in which each data is defined 
as follows. 

• Number of pixels in one block: p x p 

• Number of blocks per block line: M 

• Number of block lines: N 

■ Average code amount per target pixel: BR 

■ Average code amount permissible width per 
target pixel: BRW 

• Target total code amount: TARCVOL 
Where TARCVOL = BR x M x N x p 2 

• Initial value of scale factor: S 0 

■ Scale factor control step width: AS 

• Generated code amount for each block line: 
CVOLBL (k) 

Where k is the variable indicating the block line 
number . 
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■ Scale factor for each block line: S(k) 

• Block line number of block line where the 
encoding process is in progress: CURBN 

■ Block line number at which the scale factor is 
changed immediately before the point in time when the 
encoding process is in progress: BEFBN 

• Accumulated code amount up to the point in time 
when the encoding process is in progress: CVO LAC (CURBN) 

Where 
[ Equation 2] 

• Residual target average code amount at the 
point in time when the encoding process is in progress: 
TARBR( CURBN) 

Where 
[ Equation 3] 

• Average code amount since the scale factor is 
changed immediately before up to the point in time when 
the encoding process is in progress: SMSFBR (CURBN) 

Where 
[ Equation 4] 

The scale factor S is controlled under the 
following conditions (a) to (c) , based on the above 
definitions . 

(a) If TARBR-BRW < SMSFBR < TARBR, S(CURBN+1) = 
S (CURBN) 

(b) If SMSFBR > TARBR, S(CURBN+1) = S (CURBN) +AS 

(c) If SMSFBR < (TARBR-BRW), S(CURBN+1) = 
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S (CURBN) -AS 

The condition (a) is the case where the average 
code amount is in an appropriate range of target value, 
in which the current scale factor value is maintained 
for the next block line. 

The condition (b) is the case where the average 
code amount is beyond the permissible range and greater 
than the target value, in which the current scale 
factor value is increased by a certain step width for 
the next block line to decrease the generated code 
amount . 

The condition (c) is the case where the average 
code amount is below the permissible range and smaller 
than the target value, in which the current scale 
factor value is decreased by the certain step width for 
the next block line to increase the generated code 
amount . 

Figures 6A and 6B are the graphs showing the 
behavior of the generated code amount when such control 
is performed. The abscissa represents the block line 
number in the encoding process, and the ordinate 
represents the scale factor S on the upper part and the 
accumulated code amount CVOLAC on the lower part. 
However, the scale factor S has the reverse sign of 
positive or negative. Also, Figure 6A is a graph 
showing the behavior up to the end of encoding, and 
Figure 6B is a graph minutely showing the state during 
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the encoding. From the graph of Figure 6A, it can be 
found that the accumulated code amount for the entire 
image is controlled to be less than or equal to the 
target value. 

The average code amount since the scale factor is 
changed immediately before up to the point in time when 
the encoding process is in progress is represented as 
an inclination of the straight line connecting the 
accumulated code amount (point a with ♦ sign in the 
graph) BEFBN at the point in time when the scale factor 
is changed immediately before and the accumulated code 
amount (point b with 0 sign in the graph) CURBN at the 
point in time when the encoding process is in progress 
on the graph of Figure 6B. Also, the target average 
code amount at this point in time is represented as an 
inclination of the straight line connecting the 
accumulated code amount (point b with 0 sign in the 
graph) at this point in time and the target accumulated 
code amount (point c with T sign in the graph) for all 
the block lines. 

Herein, the advantage of using the average code 
amount since the scale factor is changed immediately 
before up to the point in time when the encoding 
process is in progress as an observation amount for the 
control of the scale factor will be described below. 
For comparison, the other observation amounts for the 
control of the scale factor are given as follows. 
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(1) The total average code amount TOTALBR from 
the start of encoding up to the point in time when the 
encoding process is in progress, 

[ Equation 5] 

(2) The average code amount CURBR for only the 
block line at the point in time when the encoding 
process is in progress, 

[ Equation 6] 

In the case of using the total average code 
amount of (1) from the start of encoding up to the 
point in time when the encoding process is in progress, 
the behavior is less sensitive to a change in the 
generated code amount under the control of scale factor. 
This is shown in a graph of Figure 7. The average code 
amount of (1) is represented as an inclination of the 
straight line connecting the accumulated code amount 
(point a with ♦ sign in the graph) at the start point 
of encoding and the accumulated code amount (point d 
with 0 sign in the graph) at the point in time when the 
encoding process is in progress on the graph of Figure 
7. The graph of Figure 7 is the case where the scale 
factor at the point in time of starting the encoding is 
too small for the target code amount. But at the point 
in time c as indicated by O sign in the figure, though 
the average code amount for only this block line is 
already fully controlled for the target value, the 
total average code amount still exceeds the target 
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value, whereby the scale factor is changed. This 
appears as a local degradation in the image quality of 
the decoded image. In this manner, the excess control 
for the scale factor unfavorably has adverse influence 
on the image quality of the decoded image. 

On the contrary, in the case of using the 
accumulated code amount since the scale factor is 
changed immediately before up to the point in time when 
the encoding process is in progress, the scale factor 
is controlled with an inclination of the straight line 
connecting the point b as indicated • sign and the 
point c as indicated by O sign, whereby the excess 
control for the scale factor does not occur. 

Next, in the case of using the average code 
amount of (2) for only the block line at the point in 
time when the encoding process is in progress, the 
behavior is conversely too sensitive to a change in the 
generated code amount of the image. This is shown in a 
graph of Figure 8. The average code amount of (2) is 
represented as an inclination of the straight line 
connecting the accumulated code amount (point c with 0 
sign in the graph) at the point in time when the 
encoding process is in progress and the accumulated 
code amount (point b with ♦ sign in the graph) for the 
immediately previous block line on the graph of Figure 
8. Though the graph of Figure 8 is the case where the 
generated code amount locally increases during the 
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encoding, the average code amount for only this block 
line exceeds the target value at the point in time as 
indicated by 0 sign in Figure 8, whereby the scale 
factor is changed. Of course, the property of the 
image changes with this block line as a boundary, in 
which if the generated code amount is increased over 
the long term, it is required to control the scale 
factor, but if the generated code amount happens to be 
increased for several block lines, the change of the 
scale factor may not be required. In such a case, if 
the average code amount of (2) for only the block line 
at the point in time when the encoding process is in 
progress is used, the scale factor is necessarily 
changed. This is the excess control for the scale 
factor, as in the previous case (1), and unfavorably 
appears as a local change in the image quality of the 
decoded image. If the average code amount since the 
scale factor is changed immediately before up to the 
point in time when the encoding process is in progress 
is used, the scale factor is controlled according to 
the inclination of the straight line connecting the 
point a as indicated by • sign and the point c as 
indicated by 0 sign in Figure 8, whereby the excess 
control for the scale factor does not occur. 

Incidentally, in the ordinary photographed image, 
an important subject is often placed in the central 
part of the image. Therefore, the total target code 
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amount may be changed halfway so that the target 
average code amount is larger in the central part of 
the image and smaller at the upper end part and the 
lower end part of the image, thereby allocating a 
greater code amount to the central part. 

Figure 9 shows an example in which the upper one- 
fourth of the image is the upper end part, the lower 
one-fourth of the image is the lower end part, and the 
central one-half is the central part, whereby the code 
amount that is (1+a) times the total target code amount 
per pixel is allocated to the central part, and the 
code amount that is (1-a) times the total target code 
amount per pixel is allocated to the upper end part and 
the lower end part. In this case, the target total 
code amount TARCVOL is defined as follows. 

(a) When the block line number k is 0 < k < N/4 
(line a in Figure 9) , 

TARCVOL = INITTARBR x BR x (1-a) x M x N x p 2 

(b) When the block line number k is N/4 < k < 
3N/4 (line b in Figure 9), 

TARCVOL = INITTARBR x BR x (1+0. 5a) x M x N x p 2 

(c) When the block line number k is 3N/4 < k < N 
(line c in Figure 9), 

TARCVOL = INITTARBR x BR x M x N x p 2 
Where INITTARBR is an initial value (bit/pel) of 
the average code amount per target pixel. According to 
this embodiment, the target average code amount is made 
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larger in the central part of the image than the upper 
and lower end parts of the image, whereby it is 
possible to reduce a degradation in the image quality 
of the decoded image in the central part, and compress 
the entire image to the target code amount or less. 

In the above manner, the generated code amount is 
measured for each block line during the encoding, and 
compared with the average code amount since the code 
amount parameter is changed immediately before up to 
the point in time when the encoding process is in 
progress, and the compression ratio control parameter 
is adjusted every time, whereby the entire image can be 
compressed to the target code amount or less without 
causing a local degradation in the image quality of the 
decoded image. 

The control of the code amount may be made not 
only by using the scale factor, but also by changing 
the quantization matrix itself. 
[ Advantages of the Invention] 

With this invention, it is possible to provide an 
image data compression method for assuring the number 
of images that can be compressed and stored in a 
certain storage capacity without causing a local 
degradation in the image quality of the decoded image 
in encoding and storing a plurality of still images in 
real time by a combined method of orthogonal transform 
and variable length coding. 
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[ Brief Description of the Drawings] 

Figure 1 is a schematic diagram showing a 

procedure for an image data compression method 

according to one embodiment of the present invention; 

Figures 2 and 3 are tables showing two examples 

of a quantization matrix for brightness signal and 

color difference signal; 

Figure 4 is a table showing an example of zigzag 

scan; 

Figure 5 is a schematic screen view for 
explaining the operation of this invention; and 

Figures 6 to 9 are graphs showing how the 
generated code amount changes. 
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Figure 1 



Tl 



Blocking 



T3 



Quantization 



T4 



DC component difference coding 



T5 



Huffman coding 



T6 



Zigzag scan 



T7 



Run length coding 



T8 Scale factor control 

#1 Input image data 

#2 Quantization matrix 

#3 Scale factor 

#4 Compressed data 

#5 Procedure of this invention 

Figure 2 

#1 Quantization matrix for brightness signal 
Figure 3 

#1 Quantization matrix for color difference signal 
Figure 4 

#1 Table for zigzag scan 

Figure 5 
#1 One pixel 
#2 One line 
#3 m pixels 
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#4 n lines 

#5 Total screen m x n pixels 

#6 One block (8x8 pixels) 

#7 One block line 

#8 M blocks 

#9 N block lines 

#10 Total screen M x N blocks 

#11 Screen configuration view 

Figure 6 

#1 Block line number 
Figure 7 

#1 Block line number 
Figure 8 

#1 Block line number 
Figure 9 

#1 Block line number 
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